Some issues on posterior predictive model checks for item response theory models ∗

نویسنده

  • Stefania Mignani
چکیده

Assessing the validity of model assumptions is a crucial part of any parametric statistical analysis. The question of interest is whether a particular specified model appears to provide an adequate fit. For item response theory models the goodness-of-fit problem has not been completely solved. The possible number of response patterns is large for even moderately long assessments, leading to sparse contingency table so the standard chi-square tests do not apply directly. In the last years several solutions have been developed to address this problem. Some of them considered limited information statistics but there is no universally accepted model fit measure (Cagnone and Mignani, 2007; Maydeu-Olivares and Joe, 2008). Furthemore, a number of researchers also recommended the use of a parametric bootstrap (Mavridis et al., 2007). Another alternative is the Bayesian approach, which provides not only an optimal estimation of the parameters but also the associated uncertainty. One well-known procedure for Bayesian model fit is the posterior predictive model checking (PPMC). The method was firstly developed by Rubin (1984) and was later extended to include general discrepancies by Gelman et al. (1996). The method compares the observed data with replicated data generated or predicted by the model using a number of diagnostic measures that are sensitive to model misfit. Any systematic differences between aspects of the observed data set and those of the replicated data sets indicate a failure of the model to explain those aspects of the data. The choice of discrepancy measures is crucial in the application of the PPMC method. Discrepancy measures should take into account characteristics of the model relating to the scientific purposes to which the inference will be applied and to measure aspects of the data not directly addressed by the probability model. The results of the chosen discrepancy measure can be investigated by a graphical approach. Graphical display is the most natural and easily tool to implement posterior predictive checks. If graphical displays do not suffice or are cumbersome, one can use a tail-area probability, also known as a posterior predictive p-value (PPP-value). Wu et al. (2014) proposed the relative entropy posterior predictive model checking (RE-PPMC) approach to complement the original procedure so that an arbitrary assertion of graphical comparison can be avoided. PPMC is a powerful and flexible tool and has many advantageous properties. By constructing the reference distribution empirically, PPMC overcome critical problems of traditional techniques. Furthermore, PPMC incorporates uncertainty of model parameter estimates into ∗Presented at the second internal meeting of the FIRB (“Futuro in ricerca” 2012) project “Mixture and latent variable models for causal-inference and analysis of socio-economic data”, Rome (IT), January 23-24, 2015

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Lognormal Model for Response Times on Test Items

A lognormal model for the response times of a person on a set of test items is investigated. The model has a parameter structure analogous to the twoparameter logistic response models in item response theory, with a parameter for the speed of each person as well as parameters for the time intensity and discriminating power of each item. It is shown how these parameters can be estimated by a Mar...

متن کامل

A Hierarchical IRT Model for Criterion-Referenced Measurement

A hierarchical IRT model is proposed for mastery classification in criterionreferenced measurement. In this model, items measuring the same criterion are grouped, and a difficulty and discrimination parameter of the criterion is estimated on the same scale as the person and item parameters. The level of proficiency of a student with respect to the criterion is determined by the probability of s...

متن کامل

A Hierarchical Latent Variable Model for Ordinal Data with \No Answer" Responses

An item response theory model for ordinal responses proposes that the probability of a particular response from a person on an speci c item is a function of latent person and question parameters and of cuto s for the ordinal response categories. This structure was incorporated into a Bayesian hierarchical model by Albert and Chib (1993). We extend their formulation by modeling \No Answer" respo...

متن کامل

Latent class models for testing monotonicity and invariant item ordering for polytomous items.

Two assumptions that are relevant to many applications using item response theory are the assumptions of monotonicity (M) and invariant item ordering (IIO). A latent class model is proposed for ordinal items with inequality constraints on the class-specific item means. This model is used as a tool for testing for violations of M and IIO. A Gibbs sampling scheme is used for estimating the model ...

متن کامل

Posterior Predictive Checks for Model Assessment in Occupancy Modeling

Though methodology and motivation for the use of posterior predictive checks has been introduced in some textbooks (e.g. Gelman et al. 2004), the technique is not commonplace in, for example, applied ecological studies using Bayesian techniques. In many studies using Bayesian techniques, the only semblance of model checking is the use of DIC for model selection, but this does not provide practi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015